skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Bao, Han"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. SUMMARY The metabolism of tetrahydrofolate (H4PteGlun)‐bound one‐carbon (C1) units (C1metabolism) is multifaceted and required for plant growth, but it is unclear what of many possible synthesis pathways provide C1units in specific organelles and tissues. One possible source of C1units is via formate‐tetrahydrofolate ligase, which catalyzes the reversible ATP‐driven production of 10‐formyltetrahydrofolate (10‐formyl‐H4PteGlun) from formate and tetrahydrofolate (H4PteGlun). Here, we report biochemical and functional characterization of the enzyme fromArabidopsis thaliana(AtFTHFL). We show that the recombinant AtFTHFL has lowerKmandkcatvalues with pentaglutamyl tetrahydrofolate (H4PteGlu5) as compared to monoglutamyl tetrahydrofolate (H4PteGlu1), resulting in virtually identical catalytic efficiencies for the two substrates. Stable transformation ofArabidopsisplants with the EGFP‐tagged AtFTHFL, followed with fluorescence microscopy, demonstrated cytosolic signal. Two independent T‐DNA insertion lines with impaired AtFTHFL function had shorter roots compared to the wild type plants, demonstrating the importance of this enzyme for root growth. Overexpressing AtFTHFL led to the accumulation of H4PteGlun + 5,10‐methylene‐H4PteGlunand serine, accompanied with the depletion of formate and glycolate, in roots of the transgenicArabidopsisplants. This metabolic adjustment supports the hypothesis that AtFTHFL feeds the cytosolic C1network in roots with C1units originating from glycolate, and that these units are then used mainly for biosynthesis of serine, and not as much for the biosynthesis of 5‐methyl‐H4PteGlun, methionine, andS‐adenosylmethionine. This finding has implications for any future attempts to engineer one‐carbon unit‐requiring products through manipulation of the one‐carbon metabolic network in non‐photosynthetic organs. 
    more » « less
  2. When dealing with data from distinct locations, machine learning algorithms tend to demonstrate an implicit preference of some locations over the others, which constitutes biases that sabotage the spatial fairness of the algorithm. This unfairness can easily introduce biases in subsequent decision-making given broad adoptions of learning-based solutions in practice. However, locational biases in AI are largely understudied. To mitigate biases over locations, we propose a locational meta-referee (Meta-Ref) to oversee the few-shot meta-training and meta-testing of a deep neural network. Meta-Ref dynamically adjusts the learning rates for training samples of given locations to advocate a fair performance across locations, through an explicit consideration of locational biases and the characteristics of input data. We present a three-phase training framework to learn both a meta-learning-based predictor and an integrated Meta-Ref that governs the fairness of the model. Once trained with a distribution of spatial tasks, Meta-Ref is applied to samples from new spatial tasks (i.e., regions outside the training area) to promote fairness during the fine-tune step. We carried out experiments with two case studies on crop monitoring and transportation safety, which show Meta-Ref can improve locational fairness while keeping the overall prediction quality at a similar level. 
    more » « less
  3. Cloud masking is both a fundamental and a critical task in the vast majority of Earth observation problems across social sectors, including agriculture, energy, water, etc. The sheer volume of satellite imagery to be processed has fast-climbed to a scale (e.g., >10 PBs/year) that is prohibitive for manual processing. Meanwhile, generating reliable cloud masks and image composite is increasingly challenging due to the continued distribution-shifts in the imagery collected by existing sensors and the ever-growing variety of sensors and platforms. Moreover, labeled samples are scarce and geographically limited compared to the needs in real large-scale applications. In related work, traditional remote sensing methods are often physics-based and rely on special spectral signatures from multi- or hyper-spectral bands, which are often not available in data collected by many -- and especially more recent -- high-resolution platforms. Machine learning and deep learning based methods, on the other hand, often require large volumes of up-to-date training data to be reliable and generalizable over space. We propose an autonomous image composition and masking (Auto-CM) framework to learn to solve the fundamental tasks in a label-free manner, by leveraging different dynamics of events in both geographic domains and time-series. Our experiments show that Auto-CM outperforms existing methods on a wide-range of data with different satellite platforms, geographic regions and bands. 
    more » « less
  4. Estimating human mobility responses to the large-scale spreading of the COVID-19 pandemic is crucial, since its significance guides policymakers to give Non-pharmaceutical Interventions, such as closure or reopening of businesses. It is challenging to model due to complex social contexts and limited training data. Recently, we proposed a conditional generative adversarial network (COVID-GAN) to estimate human mobility response under a set of social and policy conditions integrated from multiple data sources. Although COVID-GAN achieves a good average estimation accuracy under real-world conditions, it produces higher errors in certain regions due to the presence of spatial heterogeneity and outliers. To address these issues, in this article, we extend our prior work by introducing a new spatio-temporal deep generative model, namely, COVID-GAN+. COVID-GAN+ deals with the spatial heterogeneity issue by introducing a new spatial feature layer that utilizes the local Moran statistic to model the spatial heterogeneity strength in the data. In addition, we redesign the training objective to learn the estimated mobility changes from historical average levels to mitigate the effects of spatial outliers. We perform comprehensive evaluations using urban mobility data derived from cell phone records and census data. Results show that COVID-GAN+ can better approximate real-world human mobility responses than prior methods, including COVID-GAN. 
    more » « less